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Abstract 

The image distortions of high-redshift galaxies caused by gravitational light deflection of 
foreground clusters of galaxies can be used to reconstruct the two-dimensional surface 
mass density of these clusters. We apply an unbiased parameter- free reconstruction 
technique to the cluster CL0939-F4713 (Abell 851), observed with the WFPC2 on board of 
the HST. We demonstrate that a single deep WFPC2 observation can be used for cluster 
mass reconstruction despite its small field of view and the irregular shape of the data 
field (especially for distant clusters). For CL0939, we find a strong correlation between 
the reconstructed mass distribution and the bright cluster galaxies indicating that mass 
follows light on average. The detected anti-correlation between the faint galaxies and 
the reconstructed mass is most likely an effect of the magnification (anti) bias, which 
was detected previously in the cluster A1689. Because of the high redshift of CL0939 
{zd = 0.41), the redshift distribution of the lensed, faint galaxies has to be accounted for 
in the reconstruction technique. We derive an approximate global transformation for the 
surface mass density which leaves the mean image ellipticities invariant, resulting in an 
uncertainty in the normalization of the mass. From the non-negativity of the surface mass 
density, we derive lower limits on the mass inside the observed field of 0.75(/i^q^ Mpc)^ 
ranging from M > 3.6 x lO^^/igo^M© to M > 6.3 x lO^^h'^^MQ for a mean redshift 
of (z) = 1 to (z) = 0.6 of the faint galaxy images with R G (23,25.5). However, we 
can break the invariance transformation for the mass using the magnification effect on 
the observed number density of the background galaxies. Assuming a mean redshift of 
(z) — 0.8 and a fraction of x = 15% {x = 20%) of cluster galaxies in the observed galaxy 
sample with R G (23, 25.5) we obtain for the mass inside the field M ^ 5 x lO^^h'^^MQ 
(M ^ 7 X lO^^/igo^Mg) which corresponds to M/L ^ 100/i5o {M/L ^ UOh^o). 
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1. Introduction 



Since the pioneering work of Tyson, Valdes and Wenk (1990) it has been reahzed that the 
weak shearing effects of clusters introduced on images of faint background galaxies can be 
used to obtain the mass distribution of these lensing clusters (for a recent review on cluster 
lensing, see Fort & Mellier 1994; see also Kochanek 1990; Miralda-Escude 1991). Kaiser & 
Squires (1993, hereafter KS) have derived an explicit expression for the two-dimensional 
surface mass density as a function of the shear (or tidal gravitational field) caused by 
the cluster, which in turn can be obtained from the distorted images of background 
galaxies. This inversion method has been applied to several clusters observed from the 
ground (Fahlman et al. 1994, Small et al. 1995a, Kaiser at al. 1995), demonstrating the 
applicability of this new method to determine mass profiles and total mass estimates of 
clusters. The detection of sheared images far out in the cluster 0024+16 (Bonnet, Mellier 
& Fort 1994; Kassiola et al. 1994) shows that weak lensing can investigate previously 
unexplored regions in clusters. 

Recently, the KS inversion technique has been modified and generalized to account 
for strong lensing, as it should occur near the center of clusters (Schneider & Seitz 
1995, Seitz & Schneider 1995a, Kaiser 1995), and for the finite data field defined by 
the CCD size (Schneider 1995, Kaiser et al. 1995, Bartelmann 1995, Seitz & Schneider 
1995b, henceforth SS). In SS, a detailed quantitative comparison between the various 
inversion techniques has been made, and it was demonstrated that the inversion formula 
derived in SS is the most accurate of the unbiased ones. In particular, if the cluster mass 
distribution is significantly more extended than the data field (i.e. the CCD), the SS 
inversion formula is significantly more accurate than the other currently known inversion 
techniques. 

Such a situation generally occurs if the data are taken with the WFPC2 on board 
the Hubble Space Telescope (HST), owing to its fairly small field of view. Hence, if 
WFPC2 images are used for the reconstruction of the surface mass density of a cluster, 
it is necessary to use a finite-field inversion formula such as the one derived in SS. As 
was pointed out in Schneider & Seitz 1995, even then the mass density can be derived 
only up to a global invariance transformation, which is the mass-sheet degeneracy found 
by Gorenstein et al. (1988). The invariance transformation may be broken if the magni- 
fication effects are taken into account which changes the local number density of images 
of an appropriately chosen subset of faint galaxies (Broadhurst, Taylor & Peacock 1995), 
and which changes the size of galaxy images at fixed surface brightness - which is un- 
changed by gravitational light deflection (Bartelmann & Narayan 1995). In particular, 
this latter paper demonstrates that the inclusion of magnification effects may improve the 
cluster mass inversion considerably, and can also provide a unique means to determine 
the redsliift distribution down to very faint magnitudes. 

In this paper we present the first application of a finite-field cluster inversion to the 
deep WFPC2 observation (10 orbits) of the distant cluster CL 0939-1-4713 retrieved from 
the HST archive. These data have been used for the study of the morphology of the 
cluster galaxies and the Butcher-Oemler effect by Dressier et al. (1994a). In Sect. 2 we 
briefiy describe the data, and discuss the image identification and the determination of 
the image shapes, which is used for the estimate of the local image distortion. Sect. 3 
briefly summarized the inversion method, the results of which are presented in Sect. 4 
and discussed in Sect. 5. 
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2. Observation and data analysis 



Fig. la. Observations of the cluster CL0939+4713 obtained with WFPC2 using the 702W filter and an 
exposure time of 21000s. The side-length of the data field is 2'5 {Ih^^ Mpc) for an EdS-universe with 
Ho = 50/i5o km/s/Mpc, 1 arcsec on the sky represents 6.51ft.^Q^ kpc). Dressier & Gunn (1992) propose 
the cluster center to be close to the 3 bright galaxies in the upper left corner of the lower left CCD. 
North is at the bottom east to the right 

C10939+4713 was observed in January 1994 with the WFPC2 camera on the Hubble 
Space Telescope (Dressier et al. 1994a). The observation consists of 10 single orbits of 
2100 seconds (or a total exposure time of 5h50min) and corresponds probably to the 
deepest cluster observation done with the HST/WFPC2. The exposures were divided 
into two groups of 5 with a shift of 10 pixel East and 20 pixels South between the two 
groups. After StSci pipeline processing, the data were shifted and combined to remove 
cosmic rays and hot pixel using standard STSDAS/IRAF routines. A mosaic of the 3 
WFC chips and the PC chips was constructed, though due to the smaller pixel size of 
the PC chip and therefore a brighter isophotal limit we discard it from the analysis. 
The image was then run through the SExtractor package (Bertin & Arnouts 1995) to 
detect objects, measure their magnitudes, mean isophotal surface brightness and second 
moments. All objects with isophotal areas larger than 12 pixels and higher than 2a /pixel 
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{l^F702W = 25.3mag/arcsec^) were selected. For each object the unweighted first and 

second moments were computed to determine their center, their size, their eUipticity 
and orientation. To convert instrumental F702W magnitudes into standard R we use 
the synthetic zero point and color corrections listed in Holtzman et al. (1995). For the 
color term we choose {V — R) ~ 0.6 typical of a 2; ~ 0.8 late-type spiral. The color 
correction is then +0.2 mag, and remains small for other choices of the colour term. The 
typical photometric errors of our faintest objects, R < 26.5, are 5R ~0. 1-0.2. A neural 
network algorithm was used to identify stellar objects, 22 of those were detected. A 
galaxy catalogue was then constructed with a total of 572 galaxies down to R=26.5. 

Fig. la shows the full WFPC2 image of the cluster, and Fig. lb a zoomed image of the 
region marked in Fig. la. A detailed inspection discovered the arc candidate and a likely 
pair in this central cluster region. These strong lensing features confirm that the cluster 
is over-critical and probably indicate the densest part of the cluster. The spectroscopic 
observation of the bright pair (R=22.5, and R=22.9 for it's counter-image candidate) 
will confirm or otherwise the lensing assumption. If it is indeed a gravitational pair, it 
will constrain strongly the mass distribution of the very central part of the cluster. 



Fig. lb. A zoomed image of tlie region marked in Fig. la. We find an arc-candidate AO and a likely 
gravitational pair P1& 2 with the counter image P3 



Fig. 2 shows the number vs. magnitude diagram of the galaxies detected in the field 
(solid line). These numbers are compared with the field galaxies counts in the R band 



4 



from Smail et al (1995b). It is clear that most of the galaxies with R e (19, 22) are likely- 
cluster members. Furthermore there is a likely contamination from cluster members of 
~ 150 objects down io R = 25.5 within the WFC field; the cluster contamination is 
expected to be higher in the central part than in the outer part. 



20 22 24 26 




20 22 24 26 

R (mag) 



Fig. 2. The number vs. magnitude diagram of all galaxies detected within the WFC field (solid line). 
The dotted line shows the number counts - rescaled to the area of WFC field - from Smail et al. (1995b), 
which yields N{R) oc lOT^, with 7 = 0.32 and normalization such that N{R < 27) = 7.3 X 10^ per 
square degree. Assuming that the dotted line represents the counts of the faint galaxies, the dashed- 
dotted histogram gives the number counts versus magnitude of the cluster galaxies 



From the second moments of light Qij we calculate for each galaxy image the (com- 
plex) ellipticity 

Qll - Q22 + 2i(5l2 I I 2ii? fn 1 \ 

However, it is more convenient to work in terms of the ellipticity parameter e which has 
the same phase ^ as x, and modulus 



1 — r / 1 — lyl 
|e| = - with r = W- ^. (2.16) 

Then we use these image ellipticities of a galaxy at 6^ to calculate the local mean 
image ellipticity on a grid Oij 

with the weight factor 

u{d) = exp{-(f/s'^) , 

with a smoothing length s, which, unless noted otherwise, is chosen as s = 0'.3 (117 
h^Q kpc). The resulting map of e{6) is shown in Fig. 3 using all images covering more 
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than 12 pixels (pixel-size O'.'l) and using four different magnitude cuts for the galaxies: 
R e (24, 25.5) for the upper left panel, R E (23, 25.5) for the upper right, R E (22, 25.5) 
for the lower left and R E (21,25.5) for the lower right. The corresponding numbers of 
galaxies used for constructing the shear maps are 226, 295 343, and 383, respectively, 
meaning that the average number of galaxies having a distance of less than the smoothing 
length from the point Oij is about 13, 17, 20 and 22. The cut at fainter magnitude was 
chosen in order to be not too much contaminated by the circularization effect of measuring 
small galaxies with poor signal to noise. We find that the "shear field" is quite robust 
under adding brighter galaxies to the sample: the direction of the local shear vector is 
almost unchanged and its absolute value is decreased on average for the brighter galaxy 
samples, especially in regions close to the cluster center. The reason for this is that the 
modulus of the expectation value of the mean image ellipticities is smaller for background 
galaxies closer to the cluster and it is zero for cluster- and foreground galaxies, both 
leading to a decrease in the mean image ellipticities for the brighter galaxy samples. The 
direction of the expectation value of the mean image ellipticities is not changed, since the 
mean image orientation of the background galaxies does not depend on their distances 
to the cluster, and since cluster- and foreground galaxies show no preferred alignment at 
aU. 
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Fig. 3. The orientation and absolute value of the local mean image ellipticities e = |e| (cos 2^5 + isin2(p) 
of galaxies with R £ (24, 25.5) (upper left), R £ (23, 25.5) (upper right), R G (22, 25.5) (lower left) and 
R G (21, 25.5) (lower right). We exclude galaxies covering less than 12 pixels (pixel-size O'.'l). We choose 
a smoothing length of s = 0'.3; as is clearly seen, the 'coherence length' of the shear pattern is larger 
than this smoothing length. The vectors displayed include an angle of if with the x-axis, and a mean 
image ellipticity |e| = 1 would correspond to a vector of length 0'.4 
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3. Method of Reconstruction 



In this section we briefly describe the reconstruction of the cluster surface mass density 
using the observed map of mean image eUipticities. Due to the high redshift of the cluster 
(zd = 0.41) we can not assume that all source galaxies are at the same effective distance 
to the cluster, i.e. that their D{zd., z)/D{z) is the same. Therefore, we relate the critical 
surface mass density 

U -(z)- ^'-P(^) 

A7rGD{zd)D{zd,z) 

for a source at redshift z to the critical surface mass density for a source at 'infinity' by 
defining w{z) through 

w{z) rcrit(2;) = lim i7crit(2;) , 
2^00 

for z > Zd, and w{z) = for z < Zd, and obtain for the dimensionless surface mass 
density K,{d,z) and the shear 7(0, 2;) 

K{e,z) = w{z)Kooie) , j{e,z)^w{z)-f^{0) . (3.1) 

The form of w{z) depends on the geometry of the universe, and for an Einstein-de Sitter 
universe we have 

{0 for z < Zd', 

Vl+^-yi+^d r (3.2) 

The following description of the reconstruction of the surface mass density is based 
on the simplifying assumption that the cluster is not critical, i.e., (1 — iuKoo)^ — w^7^ > 
for all sources. However, we point out that all the resulting mass maps shown in this 
paper have been calculated without this assumption, using the more complicated method 
described in Seitz & Schneider (1995c). However, since the reconstruction is much easier 
to describe for non-critical clusters, we describe here the reconstruction of non-critical 
clusters only. We also note that there are only minor changes in the results if this 
assumption is introduced. 

As described in Seitz & Schneider (1995c), the local expectation value of the image 
ellipticity can be approximated through 

if Koo ^0.8 and if the mean redshift of the sources is (z) ^0.7 for this particular cluster 
redshift. In (3.3) we used the definitions 

{w') = / dzpsiz) w\z) and f=^, (3-4) 
^0 {w) 

where Ps{z) is the redshift distribution of the sources. ^From (3.3) we find that the 
transformation with 



which imphes that 7^ = A700, leaves the image eUipticities unchanged. Therefore, 
using only image eUipticities for the reconstruction, 1 — / (w) be derived only 

up to a multiplicative constant. Using the relation between the gradient of the surface 
mass density and the derivatives of the shear (Kaiser 1995), we obtain from (3.3) with 
K{d) :^ln[l-f{w)K^{e)] 



VK = 



with (e) = (ei) + i (62) and gradients (e^)^- = d (e^) /d9j. Since the mean image ellipticity 
e provides an unbiased estimator of the expectation value (e), we set (e) ^ e. Then u{6) 
can be determined from observations, for an assumed value of /, which characterizes the 
redshift distribution of the sources. From that, K{d) is obtained as 

K{0)= I (fe' U{d,d')-u{d') + K . (3.7) 
Ju 

In Eq.(3.7), the kernel H(0, O') is calculated for the data field U according to the method 
suggested by Seitz & Schneider (1995b). So far we have not calculated the kernel H for 
the irregularly-shaped WFPC2 field. Therefore, we reconstruct K on two rectangular 
fields with side- length of about 2'.5 x 1'.25 and 1'.25 x 2'.5. Since we have an additive 
constant free, we shift one of the resulting K-maps such that the mean of K inside the 
overlapping region of the two data fields is the same. Then, the resulting mass map is 
obtained by joining together these two independent reconstructions at the diagonal of the 
lower right CCD. That means that all mass maps shown here display a discontinuity at 
this diagonal; however, the jump across this line is always remarkably small, indicating 
the relative uncertainty of the reconstruction. 

The redshift distribution of the field galaxies down to the faint magnitude limits 
considered here is poorly known. Redshift surveys of considerably brighter galaxies 
indicate that the redshift distribution is fairly broad, and a high-redshift tail cannot be 
excluded (see Lilly 1993, CoUess et al. 1993, and Cowie et al. 1995, and references therein). 
We therefore take the same parameterization of Ps{z) as used in Brainerd, Blandford & 
Small (1995), 

^-^"^^ r(3%).3 ^"^("^"/"°^') • ^'-'^ 

We consider different values of the parameter (3 for which the mean redshift is given 
through (z) = zor{4/ (3)/r{3/ (3). In Fig. 4 we show the distribution Ps{z) for /3 = 1, 1.5, 3 
and {z) e {0.8, 1.0, 1.5} (right panels) and the moments (w) , (w^) and / = (w^) / (w)^ 
as a function of the mean redshift (z) for the redshift Zd = 0.41 of CL 0939+4713 (left 
panels). Kneib et al. (1995) attempted using the Abell 2218 cluster-lens to determine the 
mean redshift distribution of the faint galaxies and found a mean value of < z >~ 0.8 
for 23.5 < R < 25.5 which is above but consistent with the non-evolution expectation. 
Therefore, our choice of the parametrization of the redshift distribution shall be close to 
the true distribution of faint galaxies. 



4. Results 
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Fig. 4. The redshift distribution Ps{z), defined in Eq. (3.8), is shown in the right panels for a mean 
redshift of {z) = 0.8 (top) (z) = 1 (middle) and (z) = 1.5 (bottom), and for the parameters /3 = 1 (solid 

line), /3 = 1.5 (dotted line) and /3 = 3 (dashed line). The left panels show the moments (w), (^w'^') and 
the ratio / = (u;^^ / {wf defined in Eq. (3.4) as a function of mean redshift {z) for a cluster redshift 
Zd = 0.4 

4.1 The reconstructed mass distribution 

In Fig. 5 we show the reconstructed surface mass density for different mean redshifts 
(z), using in each case galaxies with R G (23,25.5) and the invariance transformation 
(3.5) such that the minimum of the resulting Koo-map is roughly zero to avoid unphysical 
negative surface mass densities. We see that for a mean redshift of about (z) — 0.6 — 0.8 of 
the faint galaxies in this magnitude interval, the cluster is quite strong and could indeed 
be (marginally) critical. We identify four main features, i.e., the two local maxima (the 
'first' in the lower left quadrant of the field, and the 'second' at the boundary between 
the lower left and lower right quadrant), the overall increase of towards the first 
maximum in the lower two quadrants, and a minimum in the upper right quadrant. 

In Fig. 6 we show the mass density distributions corresponding to the four shear fields 
presented in Fig. 3 assuming the redshift distribution (3.8) with /? = 1 and (z) = 0.8. 
As expected from the shear fields, the mass distribution does not change dramatically. 
We find an overall decrease in the mass density for the brighter galaxy samples from 
R e (23,25.5) to Re (21,25.5). However, the faintest sample with R e (24,25.5) gives 
a mass distribution with a slightly smaller maximum than that of i? e (23,25.5). We 
think that this is not significant and may be due to the fact that fewer images are used. 

To investigate the stability of the reconstructed mass distribution, we repeat the 
reconstruction for the same parameters (z) = 1 and /3 = 1 as used for the reconstruction 
shown in the lower left panel of Fig. 5 {s — 0'.3), but vary the smoothing length. The 
results are shown in Fig. 7 for s = 0'.2 (upper left) , s = 0'.25 (upper right), s = 0'.35 
(lower left) and s = 0'.4 (lower right). We find that independent of the smoothing length 
all main features can be recovered. Obviously, a smoothing length s = 0'.2 gives a too 
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Fig. 5. The reconstructed surface mass density of the cluster CL 0939+4713. For the reconstruction we 

use 295 galaxy images with R € (23, 25.5) and assume that their redshift distribution is given through 
(3.8) with /3 = 1 and (z) = 0.6 (upper left), (z) = 0.8 (upper right), (z) = 1 (lower left) or {z) = 1.5 
(lower right). For all these reconstructions we use a smoothing length of s = 0'.3 in the weight function 
appearing in Eq.(2.2). Since we use no data on the upper left quadrant, and therefore cannot reconstruct 
the surface mass density there, we arbitrarily set k = in this quadrant; this leads to the 'funny' shape 
in the level plots and the jumps in the corresponding contour plots, which are seen throughout this 
paper. The small discontinuity along the diagonal of the lower right quadrant is due to joining together 
two independent reconstructions, as described in the text 

noisy reconstruction, whereas s — O'A may smooth out too much of the structure. From 
visual inspection we decieded to use a smoothing length of s 0'.3 in the remainder 
of this paper. We would like to note that a fixed smoothing length is not necessarily 
the best choice, but a smoothing length, adapted to the local signal strength, may be 
more appropriate. Such a local adaption can be objectively controlled with local x^- 
statistics, or by using regularized maximum-likelihood inversion techniques (Bartelmann 
et al. 1995). 

As a further check for the stability and reliability of the reconstructed mass distri- 
bution, we perform a bootstrap analysis: we use the data set consisting of the position- 
vectors and ellipticities of the A^gai = 295 faint background galaxies with R e (23, 25.5) 
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Fig. 6. The reconstructed surface mass density obtained for the galaxy samples corresponding to the 
shear field shown in Fig. 3: R e (24,25.5) (upper left), R G (23,25.5) (upper right), R G (22,25.5) 
(lower left) and R G (21,25.5) (lower right). For all four reconstructions we assume that the redshift 
distribution is given through (3.8) with (3 = 1 and (z) = 0.8 



and generate a number of synthetic data sets by drawing A^gai galaxies at a time with 
replacement from the original data set. For each of the synthetic data sets we perform 
the mass reconstruction. Mass distributions from three different bootstraps are shown 
in the upper left, upper right and lower left panels of Fig. 8. Taking into account that in 
the bootstrap analysis on average 1/e ^ 36% of all galaxies are not used at all, the fact 
that the main features are still recovered increases our confidence in the reconstruction. 
The average mass density of 30 bootstraps, shown in the lower right panel of Fig. 8, is 
very similar to the mass distribution shown on the lower left of Fig. 5, where all galaxies 
and the same smoothing length s = 0'3 are used. Comparing the mass reconstructions 
obtained from different bootstrapping realizations, one can see that the relative varia- 
tions are considerably larger near the boundary of the data field. This is due to the fact 
that a point on the boundary has fewer neighboring galaxies, and thus takes into account 
less information of the local shear. We want to stress, however, that this is a 'random' 
noise components, and not a systemmatic boundary effect. 
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Fig. 7. The reconstructed surface mass density for different values of the smoothing length s, with 
{z) = 1 and (3 = 1. For the upper left panel we use s = 0'.2, for the upper right s = CK25, for the lower 
left s = 0'35 and for the lower right s = CK4. Note that the main features are common to all mass 
distributions shown 

4.2 Correlation between mass and light 

We want to compare the reconstructed mass distribution with the hght distribution of dif- 
ferent samples of galaxies. For this we calculate the gaussian-smoothed light distribution 
via 



where U is the data field (i.e. the three quadrants), Ok and rrik are the positions and 
magnitudes of the galaxies used, and a smoothing scale of 0'.3 is used. The denominator 
in (4.1) corrects for boundary effects. 

In Fig. 9a we show the light distribution of all galaxies [roughly r e (17, 23)] detected 
by Dressier & Gunn (1992) on a field of 4' x 4'. Comparing this with the mass distribution 
shown in Fig. 5 & 6 we detect a remarkable correlation: the position of the maximum in 




(4.1) 
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Fig. 8. Three mass distributions (upper left and right, lower left) resulting from different bootstrapping 
realizations (see text) for (z) = 1, /3 = 1 and s = 0'3. The lower right panel shows the average of 30 
bootstrapping mass distributions 

the mass density corresponds reasonably well with the position of the maximum in the 
light distribution, which is approximately located there where Dressier & Gunn (1992) 
proposed the cluster center. The secondary mass maximum corresponds to a group of 
bright galaxies. It is more prominent in the light than in the mass distribution and 
displaced slightly to the left relative to the position of the secondary maximum in the 
mass. The minimum of the mass distribution corresponds to a region where very few 
bright galaxies are observed. 

Dressier et al. (1994b) studied the morphology of the bright (cluster) galaxies with 
the HST (WFPCl); we show in Fig. 9b the light distribution of their identified E/SO 
galaxies, tracing the old cluster galaxy population. The position of the secondary max- 
imum in this light distribution corresponds better with the position of the secondary 
mass maximum and the correspondence with the other features is as good. Hence we 
conclude that there is a correlation between the reconstructed mass distribution and the 
light distribution of the bright galaxies, which are mostly cluster galaxies. 

To investigate the correlation between mass and light more quantitatively, we calcu- 
late from each mass distribution Koo(^) obtained in a bootstrap realization (see Sec. 4.1) 
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Fig. 9. (a) The gaussian-smoothed light distribution defined through Eq. (4.1) of all galaxies [roughly 
with r G (17,23)] detected by Dressier & Gunn (1992) on a field of 4' x 4'. We use a smoothing length 
s — 0'.3. (b) The gaussian-smoothed light distribution of all E/SO galaxies identified in the field of about 
2'.b X [WFPCl, Dressier et al. (1994b)]. The area covered by the HST (WFPC2) observations is 
indicated by the solid lines 



the number 



-J. _ Sgalaxies [^oo (^galaxy) (^oo)] , . 



for different samples of N galaxies, where (noo) is the average of Kqo over our data field 
U. If the galaxies were randomly distributed, the expectation value of V would be zero, 
whereas a positive (negative) correlation of galaxies with the reconstructed mass density- 
is indicated hj V > {V <0). 

From 1000 bootstrap realizations we find the distributions p{V) shown in Fig. 10a 
for the E/SO galaxies identified by Dressier et al. (1994b) (DG: E/SO: dashed curve), all 
galaxies detected from the WFPC2 data regardless from their size (solid curve) and all 
galaxies with R e (24, 25.5) (dotted curve) - also regardless of size. Clearly, a strong pos- 
itive correlation between the reconstructed mass and the DG:E/SO galaxies is detected. 
Next, a weaker positive correlation between 'all' galaxies and the mass distribution, and 
an anti- correlation between the mass distribution and the faint galaxies [R G (24, 25.5)] 
is visible. This anti-correlation appears surprising on first sight, as certainly some of the 
faint galaxies belong to the cluster, and, as we have argued above, the cluster galaxies 
are positively correlated with the mass. To investigate this point further, we have cal- 
culted the distribution of V for the same magnitude interval, but leaving out the lower 
left CCD where the contribution from cluster galaxies is expected to be strongest. The 
resulting distribution is also plotted in Fig. 10a, indicated with an asterisk; it shown an 
even stronger anti-correlation. 

In Fig. 10b we show the mean correlation coefficient (V) from 1000 simulations as a 
function of the magnitude range of the galaxies. We find that (V) is strongly correlated 
with the faintness of the magnitude interval {mi, 7712) chosen. It decreases towards the 
fainter samples and eventually becomes negative. This is due to the larger fraction of 
background galaxies contributing to the counts in (mi, 777,2) for fainter slices. 

We now turn to a possible explanation for the anti-correlation of the faint galaxies 
with the reconstructed surface mass density: 
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Fig. 10. (a): The normalized distribution p{V) of the quantity V defined in Eq. (4.2), calculated from re- 
constructed mass distributions for 1000 bootstrap data sets drawn from the observed one (R G [23, 25.5]) 
with replacement. The solid curve shows the distribution for all galaxies detected from the WFPC2 ob- 
servations, the dotted curve for all galaxies with R G [24,25.5], the long dashed curve for all galaxies 
detected in the two right CCD frames with R G [24, 25] and the dashed curve for E/SO galaxies identified 
Dressier et al. (1994b). (b): The mean correlation coefficient (V) for different galaxy samples chosen. 
The tilde indicates that the subsample has no well-determined flux threshold 

The locally observed number density nL{> S) of lensed background galaxies with 
flux larger than S is related the unlensed number density no(> S) through the local 
magnification /j, caused by the cluster, 

where 

H{0) = / dzps{z) — T2 , (4.4) 

is the redshift-averaged local magnification, weighted by the redshift distribution of the 
galaxies. The first factor in (4.3) is due to the increase of the solid angle, whereas the 
argument of uq indicates that a magnified source can be 'intrinsically' fainter by a factor 
/X and still be included in a fiux-limited sample. Which of the two competing processes 
wins depends on the slope no(> S) of the sources. The observed galaxy counts in the 
R-band shows that it is weU fitted by N{R) oc 10^-^ with 7 = 0.32. Therefore, we obtain 
from Eq.(4.3) 
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= f'"-'-' ■ (4.5) 

no(> S) 

The magnification (see also Broadhurst, Taylor & Peacock 1995, hereafter BTP) can be 
obtained via 

' nL{>sy " 

no{> S)_ 

For 7 = 0.32, the exponent in Eq. (4.6) is cu = — 5 and a suppression of background 
galaxies is expected in regions of high magnifications, or high surface mass density. 

In Fig. 11 we show the gaussian-smoothed number density [defined as in (4.1) without 
fiux weighting] of the faint galaxies with R G (24,25.5). We see a local maximum 
where we detect the minimum of the mass, indicating that we found the expected anti- 
correlation. However, we also find two local maxima in the number density of the faint 
objects where we detect the maxima of the mass. Note that we have not corrected the 
faint galaxy density for occupation of some CCD area by bright (cluster) galaxies, which 
is of course strongest near the cluster center; this shows that the contribution of cluster 
members to the faint galaxy counts is slightly stronger than indicated by comparing 
Fig. 11 with Fig. 9. We thus conclude that a non-negligible fraction of the faint galaxies 
are cluster members, as also follows from Fig. 2. 




0.0 0.5 1.0 1.5 2.0 2.5 



Fig. 11. The gaussian-smoothed number density of the faint galaxies with R E (24,25.5). We use a 
smoothing length s — 0'.3 



Assuming that the cluster galaxies have an average correlation coefficient (V)^ > 
with the mass distribution, independent of the magnitude of the galaxies, and that the 
background galaxies have an average correlation coefficient (F)^ < 0, again independent 
of their magnitudes, one can then derive the fraction x of cluster galaxies in a magnitude 
slice (mi, 7712) through measuring 

Using {V) = (V)^ from the galaxies with R G (18,22) and (V^)^ from the galaxies 
with R G (24, 25.5)* we estimate the fraction of cluster galaxies to be 83% for the 'DC 
E/SO' sample, 76% for the DG sample, 60% for the galaxies with R G (18,24), 11% for 
R G (23, 25.5) and 5% for R G (24, 25.5). Of course these values are crude estimates only, 
but they do not appear unreasonable. 
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To summary this subsection, the correlation of bright galaxies with the reconstructed 
surface mass density shows that in this cluster 'mass follows light' on average. Hence, 
overdensities of bright galaxies correspond to local maxima in the projected mass density. 
The also significant anti- correlation of faint galaxies with the reconstructed mass pro- 
file is most likely an effect of the magnification (anti)bias, which has been pointed out 
by Broadhurst, Taylor & Peacock (1995) and which was detected in the cluster A1689 
(Broadhurst 1995). 

4.3 Limits on the mass inside the data field 

Requiring that the surface mass density can not be negative one can obtain a lower limit 
on the mass inside the data field by applying the invariance transformation (3.5) such that 
the minimum of the resulting Koo-map is zero. Using the galaxies with R e (23, 25.5), we 
find as a lower bound on the total mass inside the data field (side length about 1 Mpc h'^^) 
about M/{10^^h-^MQ) > 6.3 (4.3,3.6,2.8) for a mean redshift (z) = 0.6 (0.8,1.0,1.5). 
These limits depend only slightly on the actual form of the assumed redshift distribution, 
as shown in Fig. 12. 
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Fig. 12. Lower limits Mmin on the total mass inside the data field in units of M14 = 10 h^^ Mq as 
a function of the assumed mean redshift (z) (left) or the mean (w) (right) of the images used for the 
reconstruction, assuming the redshift distribution (3.8). Crosses show the results for f3 = 1, triangles for 
(3—1.5 and dashes for (3 — 3. The smoothing length is s — 0'.3 

The most conservative upper limit of the mass we can give is M < lO^^h'^Q Mq, 
because this corresponds to {kqo) = 1 and would most probably produce several giant 
arcs which are not observed. 

Using Eq. (4.6), one can in principle derive the local magnification fi{0) and could 
therefore break the global invariance transformation (3.5) with the measurement of fj,{d) 
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at one particular point in the cluster. However, in practice we have the following difficul- 
ties: (1) since lensing effects are nowhere weak on the whole field we have no measurement 
of ?io(> S); (2) we have no colour information of the galaxies and therefore we can not 
well distinguish between faint cluster galaxies and lensed background galaxies; (3) possi- 
ble clustering of background objects and confusion with cluster member galaxies leads to 
a high noise in the estimate of iJ,{0) and a poor resolution. Therefore, we can not derive 
an accurate high- resolution map of the magnification from Eq. (4.6). 

Nevertheless, we can use the magnification (anti) bias to derive (crude) estimates 
of the total mass inside the data field. Crucial for this is the assumption that the 
unlensed number counts no(> S) can be taken from the literature. We use the amplitude 
and slope given in Small et al. (1995b) which gives for the data field U and galaxies 
with R e (23,25.5) A^o(23, 25.5) = 278. The observed number of iV°^"(23, 25.5) = 
is a sum of cluster galaxies (CG) and galaxies belonging to the faint 
galaxy field population (FG). Now, we assume that a fraction x of the observed galaxies 
^obs g^j.g cluster galaxies and obtain 

No No • ^ ^ ^ 

For X = 0.15 (0.2) we obtain (a*^'^"'^~^)j^ = 0.849 (0.796). We perform the reconstruction 
assuming a value of A, or equivalently, a value of {kqo): and calculate from the resulting 
mass map Kqo and the shear map 700 locally the expectation value of the magnification 
(4.4) of the sources and finally average iJ,^-^'^~^ on the field U to obtain (//^■^'''~^)^. Next, 
we search for that value of (koo) which gives the value (^iJ?-^'^~^)^ corresponding to a 
certain fraction x of cluster galaxies. In Fig. 13 we show the mass estimates for x = 0.15 
(solid curve) and x = 0.2 (dotted curve) as a function of the assumed mean redshift of 
the galaxies used for the reconstruction. For comparison we show again the minimum 
mass (dashed curve) as shown in Fig. 12 for /? = 1. Prom these lower limits on the mass 
wc conclude that the fraction of cluster galaxies in the galaxy sample with R e (23, 25.5) 
is a; ^ 0.1. 

We note that recently we became aware of a paper by Belloni et al. (1995) which 
could allow for a better separation between cluster and background galaxies for the 
galaxies with R e (23,25.5): Belloni et al. used multiband photometry to get the 
redshifts of 275 bright galaxies with R < 22.5, i.e., the fraction of cluster galaxies in 
the sample of galaxies with R < 22.5. From that and the known slope of the faint field 
galaxies, we can probably derive a better estimate for the fraction of cluster galaxies in 
the faint galaxy sample with i?e (23, 25.5). 



4.4 The mass to light ratio 

We calculate the total light of all galaxies inside the field U detected by Dressier et al. 
(1994b) leaving aside those galaxies whose measured redshift exclude a cluster member- 
ship. Magnitudes for these galaxies are given in gunn-r (A = 655 nm), which correspond 
to a rest- frame wavelength of A ~ 464 nm), i.e. the measured r magnitudes correspond 
to the B [X — 443 nm) magnitudes in the rest-frame. As a result we obtain for this 
sample a total luminosity of {L/Lq)b = 5 x Prom this and the mass estimates 

shown in Fig. 13 we derive the M/L values shown in Fig. 14. If the Dressier et al. (1994b) 
sample of galaxies with r e (17, 23) represents the luminosity of cluster galaxies well and 
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Fig. 13. The mass M inside the data field in units of M14 = h^^ 10 Mq as a function of the assumed 
mean redshift (z) (left) or the mean {w) (right) of the images used for the reconstruction, assuming the 
redshift distribution (3.8) with /3 = 1. The solid curve shows the total mass in the field U assuming that 

X = 15% of all galaxies detected within R 6 (23, 25.5) are cluster galaxies, the dotted curve shows the 
result for x = 20%. For comparison we show M-^i^ from Fig. 12 (dashed curve) 



if the mean redshift of the faint galaxies [R G (23, 25.5)] used for the mass reconstruction 
is (z) — 0.8, then we derive from the non-negativity of the surface mass density a lower 
limit on M/L ^ QSh^o (dashed curve in Fig. 14) for the cluster CL0939+4713, and the 
values M/L 102/j,5o for a fraction of a; = 0.1 of cluster galaxies in the faint galaxy 
sample (solid curve) and M/L 142/irjo for x = 0.2 (dotted curve). From the absence 
of giant luminous arcs we derive - independently of the assumed redshift of the sources 
or the fraction of cluster galaxies - a robust upper limit M/L ^ 200/i5o. Of course, the 
sample chosen for calculating the luminosity of the cluster includes a certain fraction 
of background- and foreground galaxies, but this is partly compensated because we will 
miss a certain fraction of faint cluster galaxies. To derive a conservative upper limit of 
the total luminosity of the cluster, or an conservative lower limit on M/L we calculate 
the total luminosity of all galaxies detected in the field (down to i? = 26.5) and obtain 
almost twice the value {L/Lq)b found above, or half the values for M/L as shown in 
Fig. 14. 

The fact that the M/ L values (see Fig. 14) are small compared to the M/ L found 
for other cluster from a weak lensing analysis (sec Fahlman ct al. (1994), or Small et 
al. (1995a)), is not too surprising taking into account that CL0939+4713 is an optically 
selected (Abell) cluster with high redshift za — 0.41. A moderately bright cluster at this 
redshift would probably not have entered the Abell catalog. In addition, the M/L-ratio 
quoted here is uncorrected for cosmic evolution, which can be quite substantial from 
z = 0.4 to today in the B-band, so that the corresponding M/L value 'today' would be 
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Fig. 14. The values of {M/L)b in units of h5o{MQ / Lq)b obtained for the masses shown in Fig. 13. 

To calculate the luminosity L we used all galaxies identified by Dressier & Gunn (1992) in that field, 
excluding those galaxies for which a measured redshift shows that they are no cluster members 



considerably higher. 



4.5 The Rosat PSPC image 



In Fig. 15 we show the Rosat PSPC image of the cluster obtained from a gaussian smooth- 
ing of the counts with a smoothing length of s = 0'.2. The center of the coordinate frame 
coincides with the lower left corner of the HST image shown in Fig. 1. To align the PSPC 
image with the HST image we used a star. Thus, the alignment slould be better than 
one PSPC pixel (pixelsize 15"). The PSPC image shows a main X-ray emission around 
the cluster center, which roughly corresponds to the region where we derive a maximum 
of the light and a maximum of the mass distribution. The PSPC data are analysed in 
a forthcoming paper by S. Schindler. In order to perform a more detailed comparison 
between the X-ray and mass distribution one has to wait for the ROSAT HRJ image. 
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Fig. 15. The PSPC image of the cluster CL0939+4713. The contour-hnes show the photon counts, 
ranging from 9.5 down to 1.5 with a spacing of 1. North is at the bottom and east to the right 



5 Discussion 

Using deep WFPC2 data, we have reconstructed the projected (dark) matter distribution 
of the cluster C10939+4713. The distortion of faint background galaxies was used to 

construct a 'shear map' of the cluster, from which an unbiased, nonlinear estimate of 
the surface mass density was constructed. The resulting mass map is defined up to an 
overall invariance transformation, a generalization of the so-called mass sheet degeneracy. 
The mass distribution is strongly correlated with the projected distribution of the bright 
cluster galaxies; in particular, the maximum of the mass map coincides with the cluster 
center as determined from the light distribution, a secondary maximum of the map 
corresponds to a concentration of cluster galaxies, and a deep mass minimum occurs 
where the number density of cluster galaxies is lowest. We also note that the main mass 
(and light) maximum correspond to maximum in the X-ray emission, as seen with the 
ROSAT PSPC. The anti-correlation of mass with faint galaxies is interpreted as the 
difference between a positive correlation of mass with faint cluster galaxies (mainly seen 
towards the cluster center), and a magnification anti-bias (BTP), which is expected due 
to the flatness of the galaxy number counts. 

Our analysis shows that the recently developed cluster inversion techniques can be 
applied to (sufficiently deep) WFPC2 data (in order to image precisely faint galaxies), 
despite the fact that its ficld-of-view is fairly limited. It is essential to use an unbiased 
finite-field inversion technique in this case, and also, since the cluster center is (nearly) 
critical, to account for strong lensing effects. Also, owing to the fairly large redshift of 
the lensing cluster, the redshift distribution of the background galaxies has to be taken 
into account explicitly; only in weak lensing regime does this distribution not enter the 
reconstruction, but only the mean of the distance ratio D^g/ Dg. 
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We have checked the robustness of the mass reconstruction, by using different mag- 
nitude cuts for the galaxies and by extended bootstrap simulations. The m,ain features 
of the mass map - the two mass maxima, the pronounced minimum, and the overall gra- 
dient toward the cluster center - are stable. The anti-correlation of mass with the faint 
galaxies, and the strong correlation with cluster galaxies, further increases our confidence 
in the reconstruction. We have derived estimates of the cluster mass contained within 
the WFC aperture, which depend on the assumed redshift of the background galaxies. 
A robust lower limit of the mass follows from the non-negativity of the surface mass 
density, a robust upper limit comes from the absence of giant luminous arcs. To derive a 
narrower mass range, one needs to fix the parameter A contained in the invariance trans- 
formation (3.5). This can be done by using the magnification anti-bias (BTP). In the 
only case where this effect has been demonstrated before (A1689, Broadhurst 1995), a 
color criterium was used to ensure that the galaxies are likely background galaxies. Since 
we lack color information, the fraction of the faint galaxies which are cluster members 
or foreground galaxies cannot be separated from background galaxies. Nevertheless, a 
plausible range for A can be obtained and led to the mass estimates shown in Fig. 13. 

The exploration of this novel method to reconstruct the density distribution of clus- 
ters has only just begun. In contrast to current ground-based data, for which image 
ellipticities have to be substantially corrected for seeing effects, WFPC2 data provide 
relatively 'clean' probes of image ellipticities. The small field of view of WFPC2 limits 
the extent to which clusters can be mapped (specially for nearby clusters), unless mosaics 
are taken, but high-resolution mass maps such as the one constructed here are invaluable 
tools for investigating substructure in cluster mass distributions and their relation to 
substructure in the distribution of galaxies and X-ray emission. 
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